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. Abstract. In cellular environment, confinement and macromulecular crowding play 

C/3 1 an important role on thermal stability and folding kinetics of a protein. We have 

resorted to a generalized version of the Wako-Saito-Muhoz-Eaton model for protein 
folding to study the behavior of six different protein structures confined between two 



C/2 

(— I ■ walls. Changing the distance 2R between the walls, we found, in accordance with 

previous studies, two confinement regimes: starting from large R and decreasing R, 
confinement first enhances the stability of the folded state as long as this is compact 
CNJ ■ and until a given value of R; then a further decrease of R leads to a decrease of folding 

^ . temperature and folding rate. We found that in the low confinement regime both 

unfolding temperatures and logarithm of folding rates scale as R~^ where 7 values lie 
in between 1.42 and 2.35. 
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1. Introduction 

In the past the majority of experiments on protein folding have been carried out in 
diluted solutions but in the last two decades it has become clear that these experiments 
do not take into account two issues which arise in vivo and whose relevance on thermal 
stability and equilibrium rates is not negligible. Namely, crowding and confinement 
[H [21 El S]. Crowding refers to the fact that about 30% of cells internal volume is 
occupied by macromolecules such as lipids, carbohydrates and proteins themselves [1]. 
This fraction could even reach 40% in E. Coli [5]. Confinement is merely a limitation 
in the volume available to the polypeptide chain as naturally occurs in the exit tunnel 
of ribosomes or in the chaperonin cavity. 

Studying protein folding properties in a crowded environment is experimentally 
possible simply by adding high concentrations of macromolecules to solutions, but this 
approach has problems because of specific interactions which arise between proteins and 
crowding agents and because crowding promotes protein-protein aggregation [1] . Based 
on the idea that the main effect of crowding is the reduction of volume available to the 
protein due to steric constraints, theoretical studies and simulations have shown that 
crowding may be quantitatively mapped onto confinement as long as crowding agents 
are modelled as hard spheres and the volume fraction occupied by them does not exceed 
10% [6]. Thanks to this mapping, experimental and theoretical studies on confinement 
may give many hints also for crowding effects. However the above conditions often does 
not hold in the cell interior because of too high concentration of agents or presence of 
macromolecules-protein attractive interaction. In addition, gradients in macromolecule 
concentrations may exist [7j and, from a more general point of view, crowding is dynamic 
in nature whereas confinement is static. Thus, the mapping is not close enough to draw 
a completely satisfactory analogy between crowding and confinement. 

An experimental procedure to mimic the effects of confinement, is the encapsulation 
of proteins within pores of sihca gels [H [9] or glasses [10] or polyacrylamide gels [11] . 
These experiments reported, for most of the considered proteins, an increase in thermal 
stability when they are confined into nanopores. Melting temperature (Tf) shift is even 
dramatic in the cases of a-lactalbumin and RNase A, being as large as about 30 K [8|[T0]. 
On the contrary, recent experiments suggested that crowding influence on stability is 
modest [71[l2]. 

The commonly accepted reason for the increase in stability is the change in 
conformational entropy induced by confinement [131 El IISl [HI [HI [18] . Encapsulating 
the protein in a given volume disallows the most expanded configurations of the 
denatured state ensemble and so indirectly favours more compact structures and, among 
them, the folded state. The same argument explains also why confinement should lead 
to an increase in folding rates fcf as long as the nanopore size is large enough to contain 
the folded state and to permit chain reconfigurations around it [T3 | [T^ [T5| [T6l [T71 [T8l [T9] . 

From polymer physics we know that a polymer confined between two (sufficiently 
close) inert hard walls, behaves like a pancake with the radius of gyration (parallel to 
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the walls) that scales as a power of the number of monomers [201 [2T1 Furthermore, 
when it is confined within a cage with repulsive walls, its free energy follows a simple 
power law dependence on the size of the cage R [201 El]. Then, as shown by Takagi et al. 
[T7] . folding temperatures and rates should follow the scaling laws ATf ~ A In /cf ~ . 

Literature reports many values for the exponent 7: for an ideal gaussian chain 
confined between two walls [dc = 1), in a cylinder [dc = 2) or in a spherical cavity 
{dc = 3), 7 = 2 while for an excluded volume chain 7 = 5/3 for dc = 1,2 and 
7 = 15/4 for dc = 3. Using a Go-model a-carbon representation of proteins and 
Langevin simulations in a cylindrical cage, Takagi et al. [17] found 7 = 3.25 ± 0.09. 
Best and Mittal [18] simulated confinement of protein G and a 3-helix bundle in different 
geometries and reported that for dc = 1,2 both values 7 = 2 and 7 = 5/3 are a good 
estimate of the behavior of the two proteins, but they also remarked the fact that it is 
hard to distinguish which value fits best the simulations because least square fitting of 
power laws can produce biased estimates of parameters for small samples. For spherical 
confinement the same authors reported a behavior which is stronger than 7 = 2 but 
much weaker than expected behavior for the excluded volume chain (7 = 15/4). 

In the present work we confine a simple Ising-like model (WSME model) originally 
proposed by Wako and Saito in 1978 and later reconsidered by Munoz and Eaton 
[231 [211 [25l [261 [27] . Equilibrium thermodynamics of the model can be solved exactly [28] . 
The cluster variation method is exact for this model [29] and it successfully describes 
the kinetics of protein folding [301 1311 [321 [33] • More recently it has been proposed 
a generalized version of the model that permits to reproduce the general features of 
mechanical unfolding [311 [M] and, through Monte Carlo simulations, to obtain for some 
already widely studied proteins and RNA fragments, unfolding pathways which are 
consistent with results of experiments and/or of simulations made with more detailed 
models [361 [Ml [39]. The model has also been used with success to study folding 
equilibrium and kinetics and to mimic mutations of a small ankyrin repeat protein [40j . 

We use the confined WSME model to study thermodynamics and kinetics of three 
ideal structures and three simple proteins in confining conditions. The ideal structures 
are a 10 residues ideal a-helix, a 2-stranded and a 3-stranded ideal /3-sheets each 
with 7 residues per strand. Real structures are a 3-helix bundle, protein G and its C- 
terminal /3-hairpin. The paper is organized as follows: in Sec. 2 we describe the WSME 
model and its confinement. Sec. 3.1 focuses on the confinement-induced changes of 
thermodynamic stability for the different proteins while Sec. 3.2 deals with kinetics and 
the expected increase in folding rates. Some conclusions are drawn in Sec. 4. 

2. The model 

WSME model is a Go-like model in which a given residues protein is described 
by a sequence of binary variables m^, whose value is 1 if k-th residue is in the 
native configuration and otherwise. Two residues interact only if they and all residues 
between them are native and only if they are in contact in the native structure, i.e. they 



4 



have at least a pair of atoms which are closer than the threshold length of 4 A in the 
native structure. If residues i and j are in contact in the native structure we associate to 
them a negative energy —Sij (defined as in [27]) and a contact matrix element Aij = 1. 
If the two residues are not in contact Aij = 0. When the molecule is pulled at its ends 
by a constant force /, the Hamiltonian reads: 

N-l N j 

H ({mfc}, {aij}) = -^Yl ^'^^^^ Y[mk- fL (1) 

i=l j=i+l k=i 

where L = L {{nik}, {cij}) is the end-to-end length of the protein and {aij} is a set 
of new binary variables that will soon be defined and in which the greater entropy of 
non-native states is encoded. 

Here and in the following we define a native stretch from residue i to residue j as 
a sequence of native residues delimited by the two non-native residues i and j. The 
end-to-end length L is the sum of the native stretches lengths lij multiplied by a sign 
+1 or —1 (the binary variable aij) if the stretch is parallel or antiparallel to the direction 
of the force. The binary variable aij thus represents the direction of the stretch from 
z-th to j-th residue. Using the quantity Sij = (1 — mi)mi^i . . . mj_i(l — rrij), which is 
equal to 1 if the sequence of residues from i to j is a native stretch and is otherwise, 
and setting the boundary conditions mo = m/yr+i = 0, the length L is defined as: 

TV 7V+1 

L ({m4, {aij}) = ^Y1 ^ij^ij^ij (2) 

i=0 j=i+l 

The set of all possible lengths {lij} is obtained directly from the three dimensional 
structure deposited in the Protein Data Bank (pdb) as the distances between the various 
pairs of central carbon atoms {Ca^yCaj}- Besides kj, other two lengths associated to 
the stretch from the z-th to the j-th residue are important for what follows. These are 
the maximum p^^^ and the minimum p™™ among the distances between Cq. and the 
projections of each 0^^ {i ^ k < j) on the straight line from C^^ to C^^. . Note that, as 
shown in figure [U (axis x2), p™^"" > lij and p™™ < 0. 

The constrained zero-force partition function 

Z{L; / = 0) = 5^ 5^ 5 (L - L({m,}, {a,,})) e-/^^({™'=W--}^/=o) (3) 

{■mk} {cTij} 

can be recursively calculated building up the protein residue by residue and evaluating 
at each step the partition function Zn{L), where n is the number of residues achieved at 
that step (see appendix of |35] for detailed calculations). 

ai^{L) = e^^'" [zi-2{L - /i_i,„+i) + Zi_2{L + h-i,n+i)] 

(4) 
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Figure 1. Sketch of a configuration with residue mi_i = 0. Axis x\ shows relevant 
lengths of entire molecule. Axis x1 shows relevant lengths of native stretch from 
(i — l)-th to (n + l)-th residues. 



Where Xin = Y.l=l 



n 



'r=k+l 



Ekr^kr IS minus the energy of the native stretch from {i — 1) 



th to {n + l)-th residue and the initial conditions are z_i{L) = 1 for L = and 
Z-i{L) = for L ^ 0. The goal of the recursive scheme is the constrained partition 
function Z{L;f = 0) which corresponds to zj^{L). The absolute value of the possible 
end-to-end lengths of a protein cannot be greater than Lmax = 'l2iLoh,i+i, which 
corresponds to the length of the molecule in the completely unfolded, fully extended 
configuration. Thus, because of finite resolution of amino acids coordinates in the pdb 
file (which is 10~^ A), L belongs to a finite set of values in the range [— Lmax, -^^max]- 

2.1. Confinement of WSME model 

Consider again the recursive scheme of (|4]) and set the starting point of the molecule 
in the middle of the cage. In order to confine the protein into a cage of size 2R with 
perfectly repulsive walls, when adding a native stretch from (z — l)-th to {n + l)-th 
residues (which are respectively at the distances Lj_2 and L„ from the N-terminus), 
one has to require that every residue of this stretch lie inside the cage. This issue may 
be solved by considering also the lengths p^i ^+i of the native stretch (see 

axis xl of figure [1]) and inserting appropriate step functions in the recursive scheme: 



<(L) = e^^^" X 

X [Zi-2{L — /j_i^„+i)^(i? — L + li_i^n+l — 'PT-l,n+l)^i.R ~ U~l,n+1 + L + + 
+ Zi-2{L + k.i,n+i)e{R + L + - - k^l,n+l -L+ ptln+l)] 



(5) 



where 9 is the Heaviside step function: 
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Translational freedom must also be taken into account. To this end, for a given 
configuration, instead of considering simply the end-to-end length, it would be better 
to consider as the relevant length the distance between the two farthest residues of that 
configuration. We call it the configuration effective length. Fixing in the center of the 
cage the N-terminus excludes from the partition functions Zn{L) the contribution of 
some of the configurations which have an effective length shorter than 2R (for example 
in fig. 12^ configuration al has an effective length shorter than configuration a2 but 
the former is forbidden while the latter is allowed). Thus, to take into account all the 
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Figure 2. Three different configurations which would give a contribution to the 
partition function constrained at length L without any cage. With cage a only 
configurations 2 and 3 contribute. In b only configurations 1 and 3 contribute. 



configurations with an effective length shorter than the cage size, the partition function 
has to be computed for different positions of the cage relative to the N-terminus. The 
final partition function will be the sum of various partition functions at different cage 
positions. Note that some configurations will appear many times in such a scheme (for 
example state a3 of fig. [2^) as a consequence of their greater translational freedom. 

To obtain the final partition function one has to repeat this procedure considering 
all the possible positions of the cage relative to the N-terminus, i.e. to start with the 
range [—2R, 0] and to move the cage with a step AR equal to the resolution of the 
{lij} until the final range [0,2i?] is reached. To speed up computations we rounded 
the lengths to a resolution of 10~^ A . For the 3-helix bundle we checked that this 
assumption does not modify the results through a comparison with results obtained at 
the resolution of 10~^ A. 



3. Results and discussion 



3.1. Equilibrium 

In this study we considered six different structures. Three are real structures: a 3-helix 
bundle (pdb code IPRB), protein G (pdb code 2GB1) and its final hairpin. The other 
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three structures are an ideal a-helix of ten residues (radius 2.3 A, pitch 5.4 A, eij = 1 
if j = z + 4 and e^j = otherwise), a 2-stranded and a 3-stranded antiparallel /3- 
sheets with 7 residues in each strand (the 3-stranded sheet is drawn in figure [3]). In the 

f ■ • • • • • 

I I I I I I 
I I I I I I 
I I I I I I 
• ■ • • • ■ ( I 

I I I I I I 
I I I I I I 
I I I I I I 



Figure 3. Ideal antiparallel /3-sheet with 3 strands. Distance between two consecutive 
residues is 3.8 A. Dashed lines represent active contacts. For them Eij = 1, while = 
in other cases. 

following, code 'aOlO' refers to the ideal a-helix, 'b207' and 'b307' to the two /3-sheets 
which have respectively 2 and 3 strands and 'GBlh' refers to the final hairpin of protein 
G. 

To study the equilibrium response to confinement of the six structures, we 
computed, at different cage sizes R, thermodynamic quantities as the Helmholtz free 
energy, the specific heat and the average fraction of native residues. For each structure 
we varied the distance 2R between the walls, in a range from about the mininum effective 
length of the completely unfolded state to twice the maximum length of the completely 
unfolded state, i.e. from 4 A (the distance between two subsequent amino acids is 
about 3.8 A) to 2Lmax- 

If we denote with Lj^ gg- the effective length of the native state (values are reported 
in table [1]), we may naively distinguish between two different confinement regimes: 
(i) one, for 2R > L-^ qQ , which disallows the more expanded conformations of the 
non-native basin but not the folded state, and (ii) the strong confinement regime, for 
2R < 

-^N eff.) which forbids also the fully native state. 
Table [1] also shows the effective length of the unfolded state . This is obtained 

through a Monte Carlo simulation at the unfolding temperature as the average effective 
length over the configurations belonging to the unfolded basin. Details about Monte 
Carlo moves will be given in the next section. 

Table 1. Native state end-to-end length (Ln): effective length of the native state 
(-^N off ) ' maximum length of the fully unfolded state (Lmax) and effective length of 
the unfolded state (Ly ^g- ) for the six different structures. 





aOlO 


b207 


b307 


GBlh 


IPRB 


2GB1 


(A) 


14.3 


3.8 


24.0 


6.5 


40.0 


27.8 


-^Ncff. (A) 


14.3 


3.8 


24.0 


6.6 


40.0 


29.1 


^max (A) 


34.2 


49.4 


76.0 


63.8 


201 


212 


Lv eff. (A) 


14.1 


21.8 


27.3 


24.1 


40.5 


46.3 



Since, without confinement, for a given set of binary variables {rrik}, the model 



8 



admits 2^»=i'^^~™») configurations and this number grows exponentially with the amount 
of non-native residues, we may expect that confinement in a cage of size R, with 
Lma^ > 2-R > off. ! gives a reduction of conformational entropy which affects more 
the non-native basin. Besides, one has to consider translational freedom whose role is 
to further stabilize the most compact configurations irrespectively of the fact that they 
belong or not to the native basin. Thus a structure with > Ly ^g- , as in the case 

of the ideal a-helix, does not undergo any stabilization of the folded state. Figure H] 
shows the free energy landscapes for the three real structures at different confinement 
sizes (for a better comparison the free energy of the completely folded state has always 
been set to zero). For the final hairpin of protein G confinement increases the free 
energy of both the native and non-native basin: both native and non-native basin are 
destabilized but the latter is more affected. On the contrary, for the 3-helix bundle 
both native and non-native basins are stabilized, with a slightly greater stabilization 
with confinement for the native state. Finally, for protein G, only the non-native basin 
is destabilized by confinement. 




Figure 4. Free energy profile in function of the fraction of native residues M at 
various confinement radius R for the 3-hehx bundel, protein G and its final hairpin. 
Free energy of completely native state (M = 1) have been setted to zero. 



The increased stability of the native state relative to the unfolded state should 
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result in higher unfolding temperature according to (THl EZ] : 

V-^Ncff./ 

where here, and from now on, we denote with the unfolding temperature without 
confinement. For each protein, we have determined Tf as the temperature at which the 
average fraction of native residues is such that (M — Moo)/{Mo — M^o) = 0.5, where 
Moo = 1/3 is the value of M at infinite temperature and Mq f» 1 is its value at zero 
temperature. 

The ideal a-helix is destabilized by confinement already from values of R lower 
than i? = 15 A and no enhancement in the unfolding temperature could be detected 
for greater values of R. Other proteins exhibit an enhancement in their thermal stability 
to a different extent depending on their structure: the increase in unfolding temperatures 
is of few percents for the 3-stranded /3-sheet, 3-helix bundle and protein G, while for the 
two /9-hairpins Tf ~ 6.6 (ideal 2-stranded /3-sheet) and Tf ^ 2.7 (final hairpin of 
protein G). Such drastically different behavior is due to the very short effective lengths 
of native states of the two hairpins and to the limitation of the model which projects 
the positions of all residues on a single direction and loses information on the real 
three-dimensional structure. For the 3-helix bundle and for protein G, the increases in 
unfolding temperature correspond respectively to about 1.5 K and 9.3 K. 

Values i^j*^ of the cage radius for which, at equilibrium, unfolding temperature 
reaches its maximum and the extent of enhancement are reported in table O 



Table 2. Values of R for which unfolding temperature reaches its maximum (T™*^^) 
and the extent of enhancement. Values of 7 from fits to © and fit ranges. Fits in 
ranges from /2 to Lraax for 'b207' and 'GBlh' result in exponents 7'. 





b207 


b307 


GBlh 


IPRB 


2GB1 


(A) 

^f 


2 

6.55 T^o 


17 
1.013 Tj" 


3 

2.73 r|' 


25 
1.005 Tj" 


17 


7 

fit range (A) 


2.14 ±0.03 

[4,50] 


1.57 ±0.05 

[18,76] 


2.35 ±0.03 

[4,64] 


1.50 ±0.05 

[26,201] 


1.65 ±0.04 

[18,212] 


7' 

fit range (A) 


1.72 ±0.06 
[10.9,50] 




1.60 ±0.07 
[12.05,64] 







The enhancement in thermal stability can be appreciated in figure [5] where we 
reported the specific heat as a function of temperature. The top panel also shows well 
another feature of the unfolding phase transition in confined environment which is a 
decreased cooperativity with confinement [l7j. 

A fit to (E]) of unfolding temperatures as a function of R (figure [6]) yielded exponents 
7 reported in table [2l All values are in between 1.50 (3-helix bundle) and 2.35 (final 
hairpin of protein G). Remarkably, in this range we find also the theoretical values of 
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Figure 5. Specific heat Cy = ^2 3)32^ as a function of the temperature at various 
confinement radius R for protein G and its final hairpin. 



7 for an excluded volume chain confined in a slit or in a cylinder (7 = 5/3) and for a 
gaussian chain in a slit, a cylinder or a sphere (7 = 2). Furthermore, a more careful 
analysis of data in figure M suggested us to fit, in the case of the /3-hairpins, also in a 
more limited range of R values going from Ly cff./2 to Lmax (figure [7]). In this very low 
confinement regime 7 = 1.72 for the ideal hairpin and 7 = 1.6 for the final hairpin of 
protein G. 
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2R/L^ 



Figure 6. Shift in unfolding temperature as a function of confining cage radius R. Fits 
to ([6|) in ranges reported in table [2l The vertical lines represent the ranges spanned 
by fits. 
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Figure 7. Shift in unfolding temperature as a function of confining cage radius R 
for 'b207' and 'GBlh'. Fits to ^ in ranges /2 to imax- The vertical lines 

represent the ranges spanned by fits. 



3.2. Kinetics 

The folding kinetics have been studied by Monte Carlo (MC) simulations in which a 2- 
components ternary variable (m^, Sk) have been associated to each residue k. If = 1, 
Sfc = while if = 0, = c^j = ±1 is the direction of the native stretch from the 
k-i\i to the j-th residue. A single MC step consists in choosing a residue k with uniform 
probability among the N residues and changing (m^, Sk) variable with equal probability 
to any of its other two states. This move is alternated with a 0.1 A translation of 
the entire protein to the left or to the right with equal probability. Few remarks are 
necessary: suppose to have a native stretch from the i-th. to the j-th residue and to 
transform the variable (m^, Sk), i < k < j, from (1, 0) to (0, Sk = ±1)- The direction of 
the new native stretch from the fc-th to the j-th residue will be determined by Sk while 
the new native stretch from i to k will inherit the direction of the old one from i to j. 
If instead we move the state of k-th residue from (0, ±1) to (1, 0), two native stretches 
merge into one with direction equal to the direction of the first old native stretch. At 
each MC step confinement requirements must be checked. 

Changes in folding rates have been estimated [H] from Kramers kinetics, k^ oc 
-Dexp(— AGu'^^^'^^'/Zc^T), where AG\j^"^^^ is the free energy difference between the 
transition state and the unfolded state and D is a diffusion coefficient. Because the 
unfolded state is destabilized by confinement, the free energy barrier dividing the 
unfolded from the native state is smaller. Assuming that the free energy of the latter 
and the diffusion constant are not affected by confinement and that the free energy of 
the unfolded state grows by a term ~ T{R/ Lq)~'^ [201 [21] leads to the scaling law: 



We determined folding rates as the inverse of mean first passage times by using 
10^ folding trajectories. First passage time is defined as the time at which, starting 




(7) 
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from a random unfolded configuration, the weighted fraction of native contacts {Q = 

Et^T^E^Li+i^ii^ijnLi"^fc/E»='^^Eji+i^ii^y) catches up with the threshold 0.9, 
which ensures the protein has reached the folded state and has not got stuck in some 
intermediate. Temperature has been set to 0.9 T°. 

Table 3. Values of R for which the folding rate reaches its maximum /t^^*^^ at 
T = 0.9 and the extent of enhancement. Values of 7 from fits to ([7]) in the reported 
ranges. 

b207 b307 GBlh IPRB 2GB1 

19 23 12 19 18 

1.13 fc? 1.13 fc? 1.46 /fc? 1.50 /fc? 2.35 fc? 



1.42 ±0.20 1.53 ±0.33 1.54 ±0.11 1.71 ± 0.08 1.67 ±0.07 

[19,50] [23,76] [14,64] [22,201] [20,212] 



i?f n (A) 

j,max 



fit range (A) 



When decreasing R, folding is accelerated until a certain size Rj is reached, then 
folding rates start to decrease. Table [3] reports R^^^ values and the maximum extent 
of folding rates enhacement. For the /3-hairpins, the drastic difference between R^^^ 
and Rj'^ is likely due to the fact that for very small confining cages, even if the native 
state is not compromised, the structure is squeezed so much that chain reconfigurations 
towards the folded state become difficult. The same reason should explain the small 
differences between and Rr^ of other structures. 




Figure 8. Shift in folding rates at T = 0.9T^ as a function of confining cage radius 
R. Fits to ([7]) in ranges reported in table 121 



Table |3] also reports the 7 values obtained through a fit to ([7j) while figure [8] shows 
the folding rates behavior together with fit lines. If for the two hairpins we considered 
the very-low confinement regime, exponents 7 relative to folding rates are comparable 
with their equilibrium counterparts. 
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4. Conclusions 

We have investigated the effects of confinement on protein thermal stability and folding 
kinetics using a simple Ising-like model that we have contributed to develop and validate 
in recent years and now properly modified to include confinement of a polypeptide chain 
into a slit. To study thermal stability we have made use of the property of the model 
to be exactly solvable at equilibrium, while to study folding rates behavior we have 
resorted to Monte Carlo simulations. Notwithstanding the simplicity of the model and 
its unidimensionality, we obtained results which follow the general trend of previous 
experimental studies P HDl EH Slj and simulations [161 113 [IE]: provided the native 
state is compact, when reducing the space available to a given protein, both unfolding 
temperature Tf and folding rate /cf grow until a certain confinement size which depends 
on the protein. If the confinement size is further decreased, unfolding temperatures and 
folding rates decrease. Furthermore, our results also support the theoretical prediction 
[T3l \T7\ [T8] that enhancement depends on the confinement size R by the scaling law 

Among the six different protein structures studied in this work, one, a 10-residues 
ideal a-helix, does not show any enhancement of folding temperature and rate because 
its native state cannot be considered compact if compared to the average unfolded 
state. For the other five structures we found that exponents 7 lie in between the upper 
and lower values of 2.35 and 1.42 and that those obtained for unfolding temperatures 
from exact solutions at equilibrium are consistent with those obtained for folding rates 
enhancement by Monte Carlo simulations. 

Theoretical values of 7 (7 = 5/3 for a chain with excluded volume confined into a 
slit or a cylinder and 7 = 2 for a gaussian chain into a slit, a cylinder or a sphere) are not 
directly comparable to the results of our model, which differs from these theories both 
for the geometry (our chain is neither self-avoiding nor gaussian) and for the presence 
of specific interactions, which are neglected by these theories. Nevertheless, our results, 
both from thermodynamics and kinetics, for 7, are in the same range as the theoretical 
ones. 

Furthermore, for a 3-helix bundle and for protein G, our results are consistent with 
those obtained through a more realistic model by Best and Mittal [18] for confinement of 
the same proteins into a slit: 7 values are consistent and also the maximum enhancement 
extents of folding temperatures and folding rates are in good accordance. The two model 
also agree in the fact that protein G is more affected by confinement but there is no 
accordance on the confinement radius at which the 3-helix bundle reaches its maximum 
folding temperature and its maximum folding rate. 
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